(19) 



J 



(12) 



(88) Date of publication A3: 

21.04.1999 Bulletin 1999/16 

(43) Date of publication A2: 

17.12.1997 Bulletin 1997/51 

(21) Application number: 97109680.5 

(22) Date of filing: 13.06.1997 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets (1 1 ) EP 0 813 158 A3 

EUROPEAN PATENT APPLICATION 

(51) Int. CI. 5 : G06F 17/30 



(84) 


Designated Contracting States: 


(72) 


Inventor: Spencer, Graham 




AT BE CH DE DK ES Fl FR GB GR IE IT LI LU MC 




Cupertino, California 95014 (US) 




NL PT SE 










(74) 


Representative: Liesegang, Eva 


(30) 


Priority: 14.06.1996 US 661335 




Forrester & Boehmert, 








Franz-Josef-Strasse 38 


(71) 


Applicant: Excite, Inc. 




80801 Munchen (DE) 




Mountain View, California 94043 (US) 







(54) System and method for accelerated query evaluation of very large full-text databases 



(57) A system, method, and various software prod- 
ucts provide for improved information retrieval in very 
large document databases through the use of a prede- 
termined static cache. The static cache includes for 
terms that appear in a large number of documents, a 
plurality of documents ordered by a contribution that the 
term makes to the document score of the document. 
The contribution is a scalar measure of the influence of 
the term in the computed document score. The contri- 
bution reflects both the within document frequency and 
the between document frequency of the term. In addi- 
tion, the static cache includes for each term a lookup 



table that references selected entries for the term in an 
inverted index. Queries to the database are then proc- 
essed by first traversing the static cache and obtaining 
the contribution information thereform and computing 
the document score from this information. Additional 
term frequency information for other terms in the query 
is obtained by looking up the document in the lookup 
tables of the other query terms, and obtaining the term 
frequency information for such terms from the inverted 
index, or by searching the contribution caches of the 
query terms. 
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